Revisiting Near-collinearity: Numerical vs. Statistical Perspectives

نویسنده

  • Aris Spanos
چکیده

The primary objective is to bring out and discuss the issues raised by the differences between the numerical and the statistical perspective on nearcollinearity in the context of the Linear Regression model. The numerical perspective views the problem as stemming from the ill-conditioning of the (X|X) matrix, which does not necessarily originate in high sample correlations among the regressors. The latter, however, constitutes the cornerstone of the statistical perspective. The numerical measures, such as the condition number, are applicable to an arbitrary matrix of numbers, irrespective of whether these numbers denote statistical data or not. On the other hand, the various statistical measures invoke probabilistic assumptions pertaining to the stochastic process underlying the particular data. When these assumptions are invalid, the statistical measures can be highly misleading. The paper argues that these differences between the numerical and statistical measures of ill-conditioning have important implications that call into question certain aspects of the current conventional wisdom on near-collinearity. The proposed perspective views the near-collinearity as data inadequacy in learning about the parameters of interest. This perspective is used to call into question the pertinence of VIFs and shed light on certain issues raised by the literature on centering the data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regression on Manifolds : Estimation of the Exterior Derivative

Collinearity and near-collinearity of predictors cause difficulties when doing regression. In these cases, variable selection becomes untenable because of mathematical issues concerning the existence and numerical stability of the regression coefficients, and interpretation of the coefficients is ambiguous because gradients are not defined. Using a differential geometric interpretation, in whic...

متن کامل

Numerical conditioning and asymptotic variance of subspace estimates

New formulas for the asymptotic variance of the parameter estimates in subspace identi)cation, show that the accuracy of the parameter estimates depends on certain indices of ‘near collinearity’ of the state and future input subspaces of the system to be identi)ed. This complements the numerical conditioning analysis of subspace methods presented in the companion paper (On the ill-conditioning ...

متن کامل

Confronting Multicollinearity in Ecological Multiple Regression

The natural complexity of ecological communities regularly lures ecologists to collect elaborate data sets in which confounding factors are often present. Although multiple regression is commonly used in such cases to test the individual effects of many explanatory variables on a continuous response, the inherent collinearity (multicollinearity) of confounded explanatory variables encumbers ana...

متن کامل

Identifying and removing sources of imprecision in polynomial regression

Identification and removal of imprecision in polynomial regression, originating from random errors (noise) in the independent variable data is discussed. The truncation error-to-noise ratio (TNR) is used to discriminate between imprecision dominated by collinearity, or numerical error propagation, or inflated variance due to noise in the independent variable. It is shown that after the source o...

متن کامل

Lateral Collinearity and Misleading Results in Variance-Based SEM: An Illustration and Recommendations

Research Practice Ned Kock Texas A&M International University [email protected] Gary S. Lynn Stevens Institute of Technology [email protected] Variance-based structural equation modeling is extensively used in information systems research, and many related findings may have been distorted by hidden collinearity. This is a problem that may extend to multivariate analyses, in general, in the fiel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016